Identifying disgruntled employee systems fraud risk through text mining: A simple solution for a multi-billion dollar problem
نویسنده
چکیده
a r t i c l e i n f o Occupational fraud is a $652 billion problem to which disgruntled employees are a major contributor. Much security research addresses reducing fraud opportunity and increasing fraud detection, but detecting motivational factors like employee disgruntlement is less studied. The Sarbanes–Oxley Act requires that companies archive email, creating an untapped resource for deterring fraud. Herein, protocols to identify disgruntled communications are developed. Messages cluster well according to disgruntled content, giving confidence in the value of email for this task. A highly accurate naïve Bayes model predicts whether messages contain disgruntled communications, providing extremely relevant information not otherwise likely to be revealed in a fraud audit. The model can be incorporated into fraud risk analysis systems to improve their ability to detect and deter fraud. The Sarbanes–Oxley Act [36] was created in the wake of a series of prominent financial scandals to protect investors from their recurrence. The Act's provisions endeavor to reveal and prevent corporate fraud. Rules issued by the Securities and Exchange Commission (SEC) to enforce the Act are being construed to require all public companies to store every document that influences financial reporting, including all email messages sent and received, for a number of years [5,28,38,47]. Managing the huge volumes of text employees create every day has been called the biggest challenge for companies seeking Sarbanes–Oxley compliance [34]. Industry-specific mandates such as Securities and Exchange Commission rules for brokers and traders, Medicare requirements for healthcare companies, and many other regulations pose their own email retention requirements [45,53]. The Sarbanes Oxley Act's focus is financial reporting and certification as a fraud deterrent, or failing that, to enable fraud discovery [9]. Legislative and regulatory requirements to store email, along with techniques for analyzing unstructured text data, create a less obvious path for fraud deterrence and detection: finding non-financial predictors and indicators of fraud risk or actual fraud in employees' email communications. A national survey reports that 75% of organizations experienced fraud in the three months prior to the study, with employee fraud being the most prevalent [15]. Occupational fraud losses to companies in the United States are estimated to be around $652 billion per year, equivalent to an average of about 5% of total corporate revenues and a far greater share of profit [33]. Globally, the average fraud loss per company in the 2004–2007 period is estimated to be $8.2 million [25]. …
منابع مشابه
Interactive Rule Refinement for Fraud Detection
Credit card frauds are unauthorized transactions that are made or attempted by a person or an organization that is not authorized by the card holders. Fraud with general-purpose cards (credit, debit cards etc.) is a billion dollar industry and companies are therefore investing significant efforts in identifying and preventing them. It is typical to deploy mining and machine learning-based techn...
متن کاملA Critical Analysis of Financial Fraud Spam in English in Terms of Persuasive Strategies: Personalization, Presupposition, and Lexical Choices
The term ‘spam’ addresses unsolicited emails sent in bulk; therefore, the term‘financial fraud spam’ refers to unwanted bulk emails in which different tricks and techniques areemployed to swindle money from the recipients. Estimates show that more than 80% of worldwideemail traffic in 2011 was spam. It should be noted that while the number of daily spam emails in2002 was 2.4 billion, this numbe...
متن کاملAutomating Vendor Fraud Detection in Enterprise Systems
Fraud is a multi-billion dollar industry that continues to grow annually. Many organizations are poorly prepared to prevent and detect fraud. Fraud detection strategies are intended to quickly and efficiently identify fraudulent activities that circumvent preventative measures. In this paper, we adopt a DesignScience methodological framework to develop a model for detection of vendor fraud base...
متن کاملPredicting Workers’ Compensation Insurance Fraud Using SAS Enterprise Miner 5.1 and SAS Text Miner
Insurance fraud costs the property and casualty insurance industry over 25 billion dollars (USD) annually. This paper addresses workers' compensation claim fraud. A data mining approach is adopted, and issues of data preparation are discussed. The focus is on building predictive models to score an open claim for a propensity to be fraudulent. A key component to modeling is the use of textual da...
متن کاملMulti-objective Solution Approaches for Employee Shift Scheduling Problems in Service Sectors (RESEARCH NOTE)
Today, workforce scheduling programs are being implemented in many production and service centers. These sectors can provide better quality products and/or services to their customers, taking into account employees’ desires and preferences in order to increase sector productivity. In this study, an employee shift scheduling problem in the service sector is discussed. In the problem, the aim is ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Decision Support Systems
دوره 46 شماره
صفحات -
تاریخ انتشار 2009